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Abstract 

We present an expression for the covariance matrix of the set of state vectors 
describing a track fitted with a Kalman filter. We demonstrate that this expression 
facilitates the use of a Kalman filter track model in a minimum algorithm for 
the alignment of tracking detectors. We also show that it allows to incorporate 
vertex constraints in such a procedure without refitting the tracks. 



1. Introduction 

Minimum;^^^ algorithms for the alignment of tracking detectors generally come 
in two flavours, namely those that ignore and those that do not ignore the corre- 
lations between hit residuals. The former are sometimes called local or iterative 
methods while the latter are called global or closed-form methods III]. The ad- 
vantage of the closed-form methods is that for an alignment problem in which the 
measurement model is a linear function of both track and alignment parameters 
the solution that minimizes the total can be obtained with a single pass over the 
data. 

The covariance matrix for the track parameters is an essential ingredient to 
the closed-form alignment approach [2]. If the track fit is performed using the 
standard expression for the least-squares estimator (sometimes called the standard 
or global fit method), the computation of the covariance matrix is a natural part of 
the track fit. This is why previously reported implementations of the closed-form 
alignment procedure (e.g. mm HS]) make use of the standard fit. 

In contrast most modem particle physics experiments rely on a Kalman filter 
track fit for default track reconstruction. The Kalman filter is less computa- 
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tionally expensive than the standard fit and facilitates an easy treatment of multiple 
scattering in the form of process noise. However, the computation of the covari- 
ance matrix in the common Kalman track fit is not complete: The correlations 
between track parameters at different position along the track are not calculated. 
In the presence of process noise these correlations are non-trivial. Consequently, 
the result of the common Kalman track fit cannot be used directly in a closed-form 
alignment procedure. 

In this paper we present the expressions for the computation of the global co- 
variance matrix — the covariance matrix for all parameters in the track model — 
in a Kalman filter track fit. We show how this result can be used in an alignment 
procedure. Furthermore, using similar expressions we demonstrate how vertex 
constraints can be applied in the alignment without refitting the tracks in the ver- 
tex. To illustrate that our approach leads to a functional closed-form alignment 
algorithm, we present some results obtained for the alignment of the LHCb vertex 
detector with Monte Carlo simulated data. 

An important motivation for extending the Kalman track fit for use in a closed- 
form alignment approach is that the estimation of alignment parameters is not in- 
dependent of the track model. Typically, in closed-form alignment procedures the 
track model used in the alignment is different from that used in the track recon- 
struction for physics analysis, which in practise is always a Kalman filter. Some- 
times the track model in the alignment is simplified, ignoring multiple scattering 
corrections or the magnetic field. The imperfections in the track model used for 
alignment will partially be absorbed in calibration parameters. Consequently, in 
order the guarantee consistency between track model and detector alignment, it is 
desirable to use the default track fit in the alignment procedure. 

The Kalman filter has also been proposed for the estimation of the alignment 
parameters themselves nlu\ . This method for alignment is an alternative formu- 
lation of the closed-form alignment approach that is particularly attractive if the 
number of alignment parameter is large. Our results for the global covariance ma- 
trix of the Kalman filter track model and for vertex constraints can eventually be 
applied in such a Kalman filter alignment procedure. 



2. Minimum formalism for alignment 

To show that the global covariance matrix of the track parameters is an es- 
sential ingredient to the closed-form alignment approach, we briefly revisit the 
minimum formalism for alignment. Consider a track defined as 

= [m-h{x)Y V-^ [m-hix)], (1) 
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where m is a vector of measured coordinates, V is a (usually diagonal) covariance 
matrix, h(x) is the measurement model and x is the vector of track parameters. 
Note that Eq{T]is a matrix expression: m and h are vectors and V is a symmetric 
matrix, all with dimension equal to the number of measurements. 

For a linear expansion of the measurement model around an initial estimate xq 
of the track parameters. 



where 



h(x) = h(Xo) + H(X - Xq), 

dh{x) 



H = 



dx 



is sometimes called the derivative or projection matrix, the condition that the 
be minimal with respect to x can be written as 

= ^ = -2H^V-^[m-hixo)-H{x- Xo)]. 
dx 

The solution to this system of equations is given by the well known expression for 
the least squares estimator 



X = Xq-CH^V ^[m-h(xQ)] 
where the matrix C is the covariance matrix for x 



(2) 



(3) 



If the measurement model is not linear, i.e. if H depends on x, expression Eqj2] 
can be applied iteratively, until a certain convergence criterion is met, for example 
defined by a minimum change in \hQx^- In that case it makes sense to write Eql2] 
in terms of the first and second derivative of the;^^^ at the current estimate xq 





\ 






\ dx^ 


xj 


dx 


XQ 



Xq — 



and regard the iterative minimization procedure as an application of the Newton- 
Raphson method. 

We now consider an extension of the measurement model with a set of cali- 
bration parameters a, 

h(x) — > h(x, a). 
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The parameters a are considered common to all tracks in a particular calibration 
sample. We estimate a by minimizing the sum of the values of the tracks 
simultaneously with respect to a and the track parameters Xi of each track i, 

^!£l = and V, ^ = 0. (4) 

oa oxi 

Please, note that the index i refers to the track and not to a component of the vector 
X. We will omit the index from now on and consider only the^^ contribution from 
a single track. 

The number of parameters in the minimization problem above scales with the 
number of tracks. If the number of tracks is large enough, a computation that uses 
an expression for the least squares estimator analogous to Eql2lis computationally 
too expensive. A more practical method relies on a computation in two steps. 
First, track parameters are estimated for an initial set of calibration parameters uq. 
Subsequently, the total is minimized with respect to a taking into account the 
dependence of Xi on a, e.g. through the total derivative 

& d ^ Ax d 
dof da da dx 

The derivative matrix dx/da in Eql5]follows from the condition that the;^^^ of 
the track remains minimal with respect to x, which can be expressed as 



(6) 



da dx 

and results in 

&x d^x^ ( d^X^ ^ 
Aa dadx \ dx^ 

Note that if the problem is linear this derivative is independent of the actual value 
of X or a. Consequently, in this limit this expression remains valid even if the 
track was not yet minimized with respect to x. 

The condition that the total x^ of a sample of tracks be minimal with respect 
to both track and alignment parameters can now be expressed as 

dy2 

= (7) 
dor 

For M alignment parameter this defines a system of M coupled non-linear equa- 
tions. In analogy with the procedure introduced for the track minimization 
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above we search for a solution by linearizing the minimum condition around 
an initial value ao and solving the linear system of M equations 



da^ 



da 

ao 



(8) 



ao 



for Aa. In the remainder of this section we derive the expressions for these deriva- 
tives. 

To simplify the notation we define the residual vector of the track 

r = m — h{x, a) 

and its derivative to a 

dai 

We linearize r around the expansion point (;c(ao),ao), and using Eq|6] obtain for 
any total derivative to a 

d d A^Y'^HC ^ 
da da dx 

(The minus sign appears because H is the derivative of h and not of r.) In this 
expression we have substituted the covariance matrix for C for x. The first and 
second derivatives of the;^^ contribution of a single track are now given by 

— = lA^y-Hv -HCH'^]v~^r, (9) 



da 

dV 
da2 



The matrix 



2A^V-^{y -HCH^)V~^A. (10) 



R = V-HCH^ (11) 



that appears in these expressions is the covariance matrix for the residuals r. This 
matrix is in general singular and its rank is the number of degrees of freedom of 
the fit. 

If the track parameters x for which the residuals r and H are calculated, are 
actually those that minimize the track's for the current set of alignment con- 
stants ao, the residuals satisfy the least squares condition H^V~^r = and the first 
derivative to a reduces to ^ 

— = 2A^y-V. (12) 
da 
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Consequently, if V is diagonal, the derivative to a particular parameter aj only 
receives contributions from residuals for which drj/daj does not vanishQ An im- 
portant consequence of this is that if there are additional contributions to the tracks 
X^, in particular hits in subdetectors that we do not align for, constraints from a 
vertex fit or multiple scattering terms, then these terms only enter the derivative 
calculation through the track covariance matrix C. We will exploit this property 
in the next section when we discuss the use of a Kalman filter track model for 
alignment. 

The expressions Eq{T2land Eq{TO]can now be used to evaluate the first and sec- 
ond derivative for an initial calibration ao over a given track sample and inserted 
in Eql8]to obtain an improved calibration. If the residuals are non-linear in either 
track parameters or alignment parameters, several iterations may be necessary to 
minimize the x^- 

If the alignment is sufficiently constrained, the second derivative matrix can 
be inverted and the covariance matrix for the alignment parameters is given by 



Cov(Qr) = 2 



I da2 



Ignoring higher order derivatives in a, the change in the total as the result of a 
change Aor in the alignment parameters can be written as 

1 d 2^ 

AX^ = - — Aa = -Aa^ Cov(a)-' Aa. 

2 da 

Consequently, the change in the total x^ is equivalent to the significance of the 
alignment correction. The quantity Ax^ is a useful measure for following the 
convergence of an alignment. 



3. The global covariance matrix in the Kalman filter track fit 

In the global method for track fitting a track is modelled by a single A'^ param- 
eter vector (usually = 5) at a fixed position along the track. Multiple scattering 
can be incorporated in this model by introducing explicit parameters for the kinks 
at scattering planes. The parameters that minimize the;^'^ and the corresponding 
covariance matrix follow from the application of the least squares estimator Eql2l 



In other words, if a, is an alignment parameter of module X, only hits in module X contribute 
to the first derivative of the;\f^ to a, . 
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In the Kalman filter method for track fitting jlOll a track is modelled by a sep- 
arate dimensional track parameter vector (or state vector) at each measurement 
(or node). The state vectors are related by a transport function, which follows 
from the equation of motion of the charged particle. In the absence of multiple 
scattering the state vectors are one-to-one functions of one-another and hence fully 
correlated. In the presence of multiple scattering the correlation is reduced by in- 
troducing so-called process noise in the propagation of the state vector between 
neighbouring nodes. 

As we have seen in the previous section the closed-form method for align- 
ment uses the vector of residuals r and a corresponding covariance matrix R. The 
covariance matrix for the residuals can be computed from the global covariance 
matrix of the track parameters. However, the correlations between the state vec- 
tors at different nodes are normally not calculated in the Kalman filter: They are 
either not computed at all (if the smoothing is done as a weighted average of a 
forward and backward filter) or (if the Rauch-Tung-Striebel smoother formalism 
is applied) only implicitly and only between neighbouring nodes. 

To derive an expression for the covariance matrix of all parameters in the 
Kalman filter track model we use the notation of reference 1110(1 for the linear 
Kalman filter, in particular 



Xk is the state vector at node k after accumulating the information from 
measurements {1, . . . , A;}; 

• Ck is the covariance of Xk\ 

• x1 is the state vector at node k after processing all n measurements. 

In the following we first calculate the correlation between Xk-i and Xk, which 
we denote by Ck-i,k- From this we proceed with the correlation matrix between 
j and xl- The correlation between any two states k and I then follows from 
the observation that the correlation between these states occurs via intermediate 
states. 

In the notation of we have for the prediction of state k from state k-l, 

k-l _ 77 

Xk - rk-\Xk-\, 

where F is the Jacobian or transport matrix. The covariance of the prediction is 
given by 

Cl ' = Fi_iCi_iFj_j -I- Qf,-i 
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where Qi,^i is the process noise in the transition from state k - I to k. The full 
covariance matrix for the pair of states (Xk-i, xl'^) is then given by 

\ t^k-\^k-\ k-\^k-\t^ + \J.k-l 

In the Kalman filter track fit we now proceed by adding the information of mea- 
surement k to obtain a new estimate for the state in procedure that is called fil- 
tering and leads to state vector xu- The remaining measurements + 1, . . . , n are 
processed with prediction and filter steps in the same fashion. Afterwards a pro- 
cedure called smoothing can be applied to recursively propagate the information 
obtained through measurements k+ I,. . .,n back to node k. The smoothed state 
vector at node k is labelled by x^ and its covariance by C'l- To derive the expres- 
sion for the covariance matrix of the smoothed states x^. and j we first present 
the following lemma. 

Suppose we have two observables a and b with covariance matrix 

Vba Vhb 

Now suppose we have obtained a new estimate of a with variance Vaa by adding 
information. We can propagate the new information to b with a least squares 
estimator, which gives 

b = b + VhaV-l(a-a) 

Vbb = Vtb + VhaV;^\Vaa-Vaa)V;^'Vab (13) 
Vab = Vab + (Vaa-Vaa)V;^'Vab = VaaV-^b 

This expression also holds if a and b are vectors. It can be derived by minimizing 
the following 



I a 




' Vaa 


Vab ' 




- a\ 


u 






Vbb , 







+ (a - nif Vm ^ {a - m) 



(where m with variance is the additional information for a) with respect to a 
and b. 

Substituting Xk-\ for b, Xk for a and x^. for a in Eq{T3]we obtain for the corre- 
lation between the smoothed states 

Clu = C,.,FU(cl-'y' C'l = A,^,Cl (14) 
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where we have used the definition of the smoother gain matrix [11011 

A,_i = C,.^Fl,(cl-'y\ (15) 
For the smoothed state ^ and its covariance we find 



T 

k-l- 



These are the Rauch-Tung-Striebel smoothing expressions as found in [|10|1 . The 
gain matrix in Eq{T5]can be written in different forms, e.g. 



This expression shows explicitly that A<^_i = (Fk-i)~^ if there is no process noise 
(Q = 0). Therefore, as one expects, without process noise the smoothed states in 
the Kalman filter are just related by the transport equation. 

Once we have the calculated the off-diagonal element (k - l,k), we proceed 
to the next diagonal (k - 2, k). The correlation between states k - 2 and k can be 
calculated by performing a simultaneous smoothing of states k - 2 and k-l: In 
the argument above we substitute a new vector (xk-2,Xk-\) for b, rather than just 
the state x^-i . The result for the correlation (k -2,k)is 



*^k-2,k - ^k-2,k-\ \^k-\) ^k- 



\,k- 



This expression can also be derived with a simpler argument: The origin of the 
correlation between state vectors is the transport. Therefore, the correlation {k - 
2, k) occurs only through the correlations (k - 2,k - 1) and (k - l,k). Following 
the same reasoning the next diagonal becomes 



/~'n f^n I (-'n \ ^ f-'n ( f-'n \ ^ f-'n 

^k-\k ~ ^ k-Xk-2\^ k-2) ^k-2,k-\\^k'\) ^k- 

f-'n ( r^n \ ^ r^n 

~ ^ k-Xk-2\^ k-2) 



2,k' 



which shows that the calculation can be performed recursively. By substituting 
the smoother gain matrix we can write this in the following compact form 

Clu = C'UkiC'D''Cl, = A,.,Cl, k<L (16) 

If the gain matrices are temporarily stored, then if is the dimension of the state 
vector, the calculation of each off-diagonal element in the full covariance matrix of 
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state vectors requires about A'^ multiplications and A'^ additions. The total number 
of operations by far exceeds the numerical complication of the standard Kalman 
filter. However, we have found that for tracks traversing the entire LHCb tracking 
system, with a total of about 30 measurement coordinates, the computational cost 
for the global covariance matrix with the procedure above was smaller than that 
of the Kalman filter track fit itself. This is because the LHCb track fit is largely 
dominated by integration of the inhomogeneous magnetic field and the location 
of intersections with detector material. 

Now that we have calculated the full covariance matrix of all states the 
elements of the covariance matrix R of the residuals are simply given by 

Rke = VkSki - HkCl fHj . (17) 

This completes the recipe for using a Kalman filter track model in the alignment 
of tracking detectors. 

We have argued below Eq[T2lthat the cancellation that takes place between Eqj9] 
and Eq{T2lis important when considering the Kalman track fit for alignment. This 
can be explained as follows. If we were to use the Kalman filter track model in a 
global fit, the would contain explicit contributions for the diff"erence in the 
state vectors at neighbouring nodes, 

xl-i,k = (xk - Fk-iXk-if (Qk-iT^ (xk - Fk-iXk-i) . 

These contributions are equivalent to the terms that constrain scattering angles in 
the conventional track model for a global track fit. As they represent additional 
constraints to the;^^, they must also appear in the matrix V and the residual vector 
r in Eqj9l It is only because of the minimum condition for the track parameters 
that their contribution in the derivatives to the alignment parameters vanishes. 

4. Vertex constraints 

The expressions in Eq{T3]can also be used to include vertex or mass constraints 
in an alignment procedure. First, we propagate the track parameters to the esti- 
mated position of the vertex. We label the track parameters at that position with 
x") and its covariance by CJ,. The correlations between these track parameters and 
those at the position of each measurement can be computed with the procedure 
outlined in the previous section. 

For clarity we now drop the superscript n and replace it with a superscript 
(i) that labels the track in the vertex: The state of track i at the vertex is 
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with CO variance C[^. As a result of the vertex fit (which we can implement as 



the Billoir-Friihwirth-Regler algorithm [12LllO|]) we obtain the new 'constrained' 
track parameters x'q with co variance C|)\ The change in the track parameters can 
be propagated to the track states at each measurement using Eq[T3l which gives 
for the state vector at node k 

= -r + c«(csf (18) 

and for the covariance 

/^(i) _ Mi) , Mi) ( Mi)Y^ ( r^'^ — r^'h ( r^'A~^ r^'^ ^^Q^ 

The constrained residuals for track j then become 

rf = mf -/i(xf) (20) 

and the covariance matrix R in Eq|l7]can be computed using the new track state 
covariance Cf^. 

The vertex fit also gives us the covariance C^''^^ between any two tracks / and 
j in the vertex. This allows to compute the correlation between any two states in 
any two tracks as follows 

MiS _ Mi) (Mi)\-^ Mi,j) (f^m-"^ Mi) /^iN 
^k,i ~ ^kfi y--o I ^0 V'^O ) ^^^^ 

Inserting this into the multi-track equivalent of Eq[l7] gives the full correlation 
matrix for the residuals on all tracks. If the number of tracks in the vertex is 
large, the computation of the global covariance matrix for all states on all tracks 
is rather CPU time consuming. Therefore, in practical applications it makes sense 
to compute the correlation only for a subset of hits close to the vertex. 

This completes the ingredients for including vertex constraints in the calcu- 
lation of the alignment derivatives. Eventual mass constraints or other kinematic 
constraints are included implicitly if they are applied during the vertex fit. 



5. Application to the alignment of the LHCb tracking system 

The LHCb tracking system consists of a silicon vertex detector (VELO) and a 
spectrometer iflsil . For the track based alignment of this system a closed-form 
alignment algorithm has been implemented in the LHCb software framework. 
This algorithm, which uses the standard LHCb Kalman filter track fit, will be 
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described in detail in a future publication [|14l1 . Here we briefly illustrate the eff'ect 
of correlations between residuals and the applications of vertex constraints, using 
the alignment of the VELO system as an example^] 

The VELO system consists of 21 layers of double sided silicon detectors with 
radial strips on one side and concentric circular strips on the other. Each layer 
consist of two half circular disks called modules. The modules are mounted onto 
two separate support structures, the left and right VELO halves. The two halves 
can be moved independently in the direction perpendicular to the beam (z) axis in 
order to ensure the safety of the detectors during beam injection. The alignment 
of the VELO system is of crucial importance to the physics performance of the 
LHCb experiment. 

For the analysis described here we have simulated deformations in the VELO 
detector in such a way that it bows along the z-axis: we introduced a bias in 
the X and y position of each module that was approximately proportional to (z - 
Zo)^, where z- Zo was the z position of the module relative to the middle of the 
VELO. The reason to choose this particular misalignment is that it corresponds 
to a correlated movement of detector elements: Such deformations — sometimes 
called 'weak modes' — are inherently difficult to correct for with an alignment 
method that ignores the correlations between residuals in the track fit. 

Tracks from a sample of simulated minimum bias interactions were recon- 
structed using a 'cheated' pattern recognition, assigning VELO hits to a track 
based on the Monte Carlo truth. We required at least 8 hits per track. The tracks 
were fitted with the standard LHCb track fit, taking scattering corrections into ac- 
count as process noise. Tracks were accepted for alignment if their per degree 
of freedom was less than 20. In a perfectly aligned detector this cut only excludes 
a tiny fraction of reconstructed tracks, namely those with a kink due to a hadronic 
interaction. Primary vertices were reconstructed using the standard LHCb primary 
vertex finder. 

To validate the implementation of the algorithm we have performed two tests. 
First, we have checked the calculation of the residual covariance matrix R by com- 
paring it to a numerical computation. A single track was refitted after changing 
one measurement coordinate m, by a numerically small value. The i-th row of R 



^An alternative algorithm for the VELO alignment is described in ||8|]. It uses a standalone 
global track fit with a straight line track model, suitable only in the field-free region of the detector. 
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Figure 1: Logarithm of the eigenvalue versus eigenvalue index ('mode') in the alignment of the 
position and rotation of two halves of the LHCb VELO. 



follows from (for j i) 

D _ J 



where (5r^.'^ is the change in residual rj. (The computation of the diagonal element 
Rii is part of the standard Kalman fit procedure.) This test has shown that the 
numerical uncertainty in the correlations coefficients of R is typically of order 
lO"'*, which is good enough for the purpose of detector alignment. 

Second, we have analyzed the eigenvalue spectrum of the second derivative 
matrix EqUOl Without an external reference system the global translations and 
rotations of a tracking system are unconstrained in the alignment procedure. Such 
unconstrained degrees of freedom lead to vanishing eigenvalues in the derivative 
matrix and, if left untreated, result in a poorly converging alignment. (See e.g. [HI].) 
Unconstrained degrees of freedom can be removed with Lagrange constraints or 
by omitting the corresponding eigenvector from the solution to the linear system 
in EqjSl However, to test the implementation of the calculations in the global 
alignment algorithm, the identification of the vanishing eigenvalues is a powerful 
tool: If the zero eigenvalues corresponding to the global movements are observed, 
we can be confident that the computation of both the matrix R and the alignment 
derivatives dr/da is correct (or at least consistently wrong). 

Figure \T\ shows the eigenvalues for the alignment of the position and rotation 
of the two VELO halves. (The eigenvalues are plotted versus an arbitrary index 
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Figure 2: Logarithm of the eigenvalue versus eigenvalue index ('mode') in the alignment of the 
position of modules in the LHCb VELO. 



that increases with the size of the eigenvalue.) The total number of alignment 
parameters is 12. To define the scale of the eigenvalues the derivative matrix 
was rescaled following the recipe in ^6*]: the numerical value of the eigenvalue 
is roughly equal to the number of hits contributing to the corresponding linear 
combination of alignment parameters. As can be seen in the figure the eigenvalue 
distribution splits in two: The six smaller eigenvalues correspond to the global ro- 
tation and translation, whereas the six larger eigenvalues correspond to the relative 
alignment of the two detector halves. Note that if correlations between residuals 
are ignored, the linear equations Eql8]split in independent parts for the two aligned 
objects and all eigenvalues are of about the same size. 

One may wonder why the eigenvalues corresponding to the global movements 
are not 'numerically' zero, in contrast with the analysis reported in The reason 
for this is a feature of the Kalman filter: In the Kalman filter the state vector is 
seeded with a finite variance even before a single measurement is processed. The 
variance must be large enough to have negligible weight in the variance of the 
state vector after all measurements are processed, but it must be small enough to 
make the computation of the filter gain matrix numerically stable. The finite value 
of the seed variance essentially fixes the track in space. We have observed that the 
value of the small eigenvalues is indeed sensitive to the variance of the seed. For 
practical purposes the bias from the Kalman filter seed is not important. 

To test the alignment procedure for the misalignment scenario presented above 
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Figure 3: Number of selected tracks (right) and average per track (left) as a function of the 
number of alignment iterations for 3 alignment scenarios, namely ignoring correlations between 
residuals, not ignoring those correlations and including vertex constraints. The dashed line repre- 
sents the result for a perfectly aligned detector 

we aligned the position of each module in x and y, corresponding to a total of 84 
alignment parameters. We omitted the z translation and rotations to simplify the 
analysis. The eigenvalue distribution, shown in figure El reveals 4 unconstrained 
degrees of freedom. These correspond to the global translation in x and y and — 
originating from the planar geometry of the detector — shearings in the xz and yz 
plane. We constrain these degrees of freedom with Lagrange constraints. 

We report here two figures of merit that we use to judge the convergence of 
the alignment procedure, namely the number of selected tracks and the average 

of selected tracks, both as a function of the alignment iteration. The results 
are shown in figure |3] for 3 different scenarios: First, we entirely ignore corre- 
lations between residuals, which means that off"-diagonal elements in the matrix 
R in equation Eq{TT] are assumed zero. Second, we compute these correlations 
with the recipe outlined in section |3l Finally, we also include vertex constraints 
with the expressions given in section HI As can be seen in the figure the scenario 
with correlations converges faster than the scenario without. Furthermore, in the 
scenario without correlations less tracks survive the;^^ cut even after 5 iterations. 

The diff'erence in convergence behaviour is mostly because there are two kinds 
of tracks. Though most tracks pass only through a single VELO half, there is 
a small fraction that passes through small regions in which detectors from both 
halves overlap. When correlations between hit residuals are taken into account. 
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tracks that pass through a single half do not carry any weight in determining the 
relative positions of the two halves, because the contribution to thex^ is invariant 
to the global position of the detector half. Therefore, the relative position of the 
two halves is fully sensitive to the tracks that pass through both halves. On the 
contrary, if correlations are ignored, every track fixes the position of any detector 
element in space. As a result the overlap tracks get a much smaller weight in 
determining the relative position and convergence becomes poor. 

This problem can be partially overcome by explicitly enhancing the fraction 
of overlap tracks in the sample, e.g. by down-sampling the tracks that do not 
pass through the overlap regions. Such a strategy is applied in the alignment 
of the Babar vertex detector llisll . An important advantage of the closed-form 
algorithm is that it is not necessary to remove tracks with a small weight in the 
alignment as the algorithm inherently weights the information contained in the 
residuals correctly. 

6. Conclusions 

In this paper we have presented how the most popular track fitting method, 
the Kalman filter, can be used in a closed-form alignment procedure for track- 
ing detectors. Our contribution is summarized in expression Eg fT6l which shows 
how the correlations between state vectors can be computed recursively by using 
the smoother gain matrix. We have also shown how vertex constraints can be in- 
cluded without refitting the tracks. Using an implementation of this formalism in 
the LHCb software framework we have illustrated for a simple misalignment sce- 
nario of the LHCb vertex detector the importance of correlations between residu- 
als in the track fit. A more detailed analysis of the performance of the alignment 
algorithm to the LHCb tracking system will be reported in due course (l4\ . 
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